Model Quantization, ONNX Runtime, Embedded Inference, TinyML

Feeds to Scour
SubscribedAll
Scoured 4917 posts in 167.3 ms
Edge AI: The future of AI inference is smarter local compute
infoworld.com·2d
AI-Driven DevOps
Preview
Report Post
Everything Moe
ianbarber.blog·1d·
Discuss: Hacker News
🧱Chunking
Preview
Report Post
Why I Moved My ML Model from Flask to AWS Lambda (A Student’s Guide to $0 Hosting)
dev.to·1h·
Discuss: DEV
📉Model Quantization
Preview
Report Post
AI Systems Performance Engineering
github.com·4h·
Discuss: Hacker News
Performance Engineering
Preview
Report Post
Deep learning as program synthesis
lesswrong.com·1d·
🖼️Dual Coding
Preview
Report Post
AI Tools Race Heats Up: Week of January 13-19, 2026
dremio.com·2d·
Discuss: DEV
AI-Driven DevOps
Preview
Report Post
a transport layer for agentic apps
ably.com·8h·
Discuss: Hacker News
💬Prompt Engineering
Preview
Report Post
Qdrant - Vector Database
qdrant.tech·1d
🗂️Vector Databases
Preview
Report Post
MIT’s new ‘recursive’ framework lets LLMs process 10 million tokens without context rot
venturebeat.com·1d·
💸Affordable LLMs
Preview
Report Post
YC Spring – Full-Stack AI Consulting Company
news.ycombinator.com·14h·
Discuss: Hacker News
AI-Driven DevOps
Preview
Report Post
How to Break Any AI Model (A Machine Learning Security Crash Course)
dev.to·13h·
Discuss: DEV
🛡️AI Security
Preview
Report Post
Co-optimization Approaches For Reliable and Efficient AI Acceleration (Peking University et al.)
semiengineering.com·12h
🚀Performance
Preview
Report Post
Why AI Needs GPUs and TPUs: The Hardware Behind LLMs
blog.bytebytego.com·2d
🚀Performance
Preview
Report Post
Ensemble Listening Model (ELM): State-of-the Art Foundation Model Accuracy. A Fraction of the Cost.
ensemblelisteningmodel.com·1d·
Discuss: Hacker News
📉Model Quantization
Preview
Report Post
Finally! Proof That Agentic AI Scales (For Creating Broken Software)
codemanship.wordpress.com·20h·
Discuss: Hacker News
AI-Driven DevOps
Preview
Report Post
Momory: AI Real-Time Stream Subtitles and Translation
momory.dev·1h·
Discuss: Hacker News
📹WebRTC
Preview
Report Post
Field Notes on Scaling MoE Expert Parallelism with DeepEP
nousresearch.com·1d·
🚀Performance
Preview
Report Post
What AI Accountability Looks Like (I Built It)
forgeforward.substack.com·10h·
Discuss: Substack
AI Ethics & Alignment
Preview
Report Post
Show HN: Upgrade from Ralph to Eric for a more autonomous AI
dbuild.dev·23h·
Discuss: Hacker News
💬AI Code Assistants
Preview
Report Post
The Convolutional Neural Network
cocakoala.substack.com·3d·
Discuss: Substack
🧱Chunking
Preview
Report Post

Keyboard Shortcuts

Navigation
Next / previous item
j/k
Open post
oorEnter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Browse
gb
Search
/
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help